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ABSTRACT 


A  procedure  is  griven  for  computing  the  analysis  of  variance  obtained  by  fitting 
constants  of  the  experimental  setup  where  repeated  measurements  are  made  on 
subjects  over  time  and  treatments  have  a  two-way  classification  with  disproportionate 
subclass  numbers.  Under  certain  assumptions,  this  analysis  gives  a  test  of  the  three- 
factor  interaction,  tests  of  two-factor  interactions  if  there  is  no  three-factor  interaction, 
suid  teats  of  main  effects  if  there  are  no  interactions.  The  advantage  of  tIo.s  procedure 
over  the  standard  procedure  of  fitting  constants  is  in  the  amount  of  J>r<ipi'!tation 
involved. 
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ANALYSIS  OF  REPEATED  MEASUREMEim  WITH  DISPROPORTIONAH 
SAMPLES  FOR  TREATMENTS  IN  A  TWO-WAY  CLASSIHaTION 


1.  INTRODUCTION 

In  the  analysis  of  data  from  a  repeated  measurement  design  in  which  the  treat¬ 
ments  are  arranged  in  a  two-way  classification  and  the  subclass  numbers  are  dispro¬ 
portionate,  difficulties  arise  similar  in  nature  to  those  encountered  in  the  analysis  of 
data  from  a  simple  two-way  classification  experiment  with  disproportionate  subclass 
numbers.  In  the  present  paper  the  computational  procedures  for  a  method  of  analysis 
for  this  setup  are  given  along  with  a  numerical  example.  The  theoretic  development  and 
justification  for  the  proposed  method  will  be  presented  in  a  later  paper. 

In  studying  the  effects  of  treatments  on  subjects  in  time,  it  is  sometimes  desirable 
to  design  a  repeated  measurements  experiment  with  a  two-way  classification  of  treat¬ 
ments.  For  example,  a  physiologist  may  be  interested  in  studying  the  effect  over  time 
of  certain  drugs  at  different  altitudes.  It  would  be  desirable  to  have  the  drugs  and 
altitudes  in  a  factorial  arrangement  so  as  to  be  able  to  determine  whether  or  not  the 
relative  effects  of  the  drugs  are  the  same  at  all  altitudes.  As  another  example,  the 
experimenter  may  be  interested  in  wliether  males  and  females  respond  the  same  to  dif¬ 
ferent  levels  of  a  treatment.  In  this  case  sex  becomes  the  second  way  of  classification. 

Disproportionality  of  the  numbers  of  experimental  units  in  such  setups  can  occur 
for  many  reasons.  It  can  be  present  from  the  beginning  in  experiments  when  a  suf¬ 
ficient  number  of  appropriate  experimental  units  is  not  available.  Further,  even  if  an 
experiment  is  initially  set  up  with  proportionate  (which  includes  equal)  allocations,  the 
experimenter  frequently  ends  up  with  disproportionate  numbers  for  a  variety  of  rea¬ 
sons.  For  example,  subjects  may  be  dropped  from  the  experiment  owing  to  a  sickness 
which  is  completely  unrelated  to  the  effects  of  the  treatments,  or  part  of  the  expsri- 
mental  material  may  be  accidentally  lost.  When  subjects  are  not  proportionately 
allocated,  the  computation  of  the  sums  of  squares  in  the  analysis  of  variance  is  deter¬ 
mined  by  which,  if  any,  of  the  interactions  can  be  assumed  zero,  just  as  in  the  case  of 
the  two-way  classification  experiment  with  disproportionate  subclass  numbers. 

Under  the  symmetry  assumption,  which  will  be  explained  below,  the  procedure  pre¬ 
sented  herein  will  give  a  test  for  the  three-factor  interaction.  If  the  three-factor 
interaction  is  zero,  it  then  provides  tests  for  the  two-factor  interactions.  Finally,  if  all 
interactions  are  zero,  it  provides  tests  for  the  main  effects.  This  analysis  is  an  ex¬ 
tension  of  the  method  for  the  two-way  classification  frequently  referred  to  as  the 
“Method  of  Fitting  Constants"  (1). 

The  symmetry  assumption  states  that  the  observations  for  each  subject  are  equally 
correlated  in  time  and  the  variances  at  all  times  are  equal.  This  assumption  makes 
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the  study  of  the  distributions  for  the  usual  test  statistics  analytically  tractable.  See 
Hughes  and  Danford  (2)  for  an  exposition  of  this  requirement  for  the  repeated 
measurements  case  when  treatments  have  a  single  classification.  When  the  symmetry 
assumption  cannot  be  made  the  only  method  of  complete  analysis  available  at  the 
present  for  the  repeated  measurements  design  with  singly  classified  treatments  is  a 
n  'Itivariate  procedure  (3).  The  procedure  proposed  herein  is  valid  only  for  the  sym¬ 
metry  assumption.  * 


2.  MODEL  EQUATION 

For  the  experimental  setup  considered  in  this  paper,  the  model  equation  is : 

^Ijkiii  =  M  +  “|  +  +  “«ij  +  7|5  +  “7|k  +  pyjk  +  ■ 

In  the  equation  given  above,  Greek  letters  stand  for  fixed  effects  and  Latin  letters 
for  random  effects,  with: 

li  =  the  overall  mean 

a,  =  the  effect  of  the  i"'  level  of  factor  A  (i  =  1, .  .  .  r) 

=  the  effect  of  the  j"'  level  of  factor  B  (j  =  1, .  .  .  q) 

=  the  effect  of  the  k"i  time  (k  =  1, .  .  .  p) 

=  the  interaction  effect  of  the  i">' level  of  factor  A  and  the  j"‘  level  of  factor  B 
07,1,  =  the  interaction  effect  of  the  i"'  level  of  factor  A  and  the  k*'‘  time 
=  the  interaction  effect  of  the  j"'  level  of  factor  B  and  the  kl**  time 

=  ^he  interaction  effect  of  the  i">  level  of  factor  A,  the  level  of  factor  B,  and  the 
k">  time 

*iini  =  effect  of  the  m“‘  subject  in  the  i,  j'*‘  group  (m  =  1,  . .  .  n,j)  with  the  assumptions: 

E  (8ij,„)  =  0 

E  (Sijm.  i  =  i',  j  =  j',  m  =  m 

(0;  otherwise 

C^ijkm  =  random  error  term  plus  the  inseparable  (for  this  design)  interaction  effect  of  the 
m"<  subject  in  the  i,  j"*  group  and  the  k"*  time,  with  the  assumptions: 

E  (gijk.„)  =  0 

E  (gijk,„.  Ki-j-kw)  =  i=  j  =  j',  k  =  k',  m  =  m' 

(0;  otherwise 

Finally,  for  purposes  of  testing  later  on,  it  is  assumed  that  the  Sijm  and  gijkm  are 
normally  distributed  and  the  Sij„  are  independent  of  the  gijk„  for  all  i,  j,  k,  m.  Writing 
the  model  with  a  random  subject  effect  and  uncorrelated  error  implies  that  the  ob¬ 
servations  for  a  subject  are  equally  correlated  in  time.  This,  with  the  assumption  that 
the  variances  at  all  times  are  equal,  which  is  implied  in  the  assumptions  on  the  Sym 
&nd  gijkmt  is  the  symmetry  assumption. 

3.  ANALYSIS 

Certain  sums  of  squares  are  unaffected  by  the  disproportionality  and  take  the  usual 
form.  These  are: 

SS(time)  =  2  Y  ^  ./n .  .  -  Y  ?. .  ./pn . . 


2 


SS  (Error  (a))  =  SS  (Subjecta/grroupa) 

=:  Y;1  .  „,/p  -  22  Y,^  ./pn„ 

1  j  n.  I  J 

SS  (Error  (b))  =  SS  (Subject  x  time/groups) 

=  2:222  Y^jk,,,  -  222  yL  ./n,.  -  SS  (Error  (a)) 

I  J  k  ni  I  J  k 

where  n  . .  =  SS  iiij  and  Y  . .  w .  =  SSS  Yiji„„  ;  i.e.,  the  dot  indicates  summation 

i  J  I J  III  • 

over  the  index  it  replaces.  The  other  sums  of  squares  required  in  the  analysis  of 
variance  can  be  most  clearly  shown  by  using  matrix  notation  in  part.  For  that  purpose 
the  quantities  C  and  Gk  '’  will  be  utilized,  where  r'  =  r  —  1,  q'  =  q  —  1,  and  Gk’^'  is  the 
trarispose  of  Gk.  Using  this  notation,  the  sums  of  squares  in  the  analysis  of  variance 
can  be  computed  as  shown  in  table  I,  where  again  the  dot  indicates  summation  over 
the  index  it  replaces ;  e.g.,  G.'*’  =  S  Gk'*'.  Thus,  in  order  to  compute  the  sums  of  squares 

k 

it  is  only  necessary  to  calculate  certain  standard  sums  of  squares,  obtain  the  inverse 
of  a  matrix  of  order  r  -f  q  —  1,  and  perform  some  simple  matrix  multiplications. 
The  above  procedure  for  computing  sums  of  squares  is  equivalent  to  fitting  constants, 
but  affords  simpler  computational  forms. 
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TABLE  I 


Sums  of  squares  and  degrees  of  freedom  for  the 
analysis  of  variance 


Source 

d.f. 

S.  Sq. 

A 

r  -  1 

r 

G  .  C  '  G  . /p  -  i:  Yi-j .  . /pn  .  J  =  (1) 
j 

T 

B 

0  -  1 

a.C“'c:./p  -  lYp..  /pn,.  =  (2) 

1 

A  X  B 

1)  (q  -  1) 

--  Yfj . .  /pn,|  -  G  .  C  '  ‘g  .  /p  =  (3) 

•  j 

Error  (a) 

n  . .  —  ifi 

:  -  :vvv.^:../pn..  :zz(4> 

5  j  IM  i  j 

Time 

p  - 1 

Y®.  .  ^  .  /n  .  .  -  Y'  .  .  .  /pn  .  . 

k 

A  X  t 

(r  —  1)  (p  _  1) 

i'GiJc-'G,  -  irY-;j,./n.j-  (1) 

k  J  k 

• 

B  X  t 

(q  -  1)  (p  -  1) 

'-G'C-^G^  -  IS  Y'.,./n,.  -  (2) 

k  i  k 

A  X  B  X  t 

(r  -  1)  (q  -  1)  (p  -1) 

SSS  Y®,  . /n,,  -  vg',C  -'g,  -  (3) 

•  J  k  W 

• 

vwv  Y*  __  VVV  Y*^  /n  —  (A\ 

i  J  w  Ml  1  j  k 

Error  (b) 

(n  .  .  -  rq)  (p  -  1) 

Total 

pn  . .  —  1 

9 

'Not  Riven  Mtncc  in<iivi«l«i}il  .sums  of  sfiuniX'H  ilo  mil  Kum  lo  totiil  sum  of  siiuni'cs. 


Under  the  assumptions  given  in  the  model  statement,  the  Error  (a)  mean  square 
is  the  appropriate  mean  square  for  testing  the  effects  “above  the  line”  and  Error  (b) 
mean  square  is  used  for  testing  the  effects  “below  the  line.”  In  this  analysis,  the  sum 
of  squares  for  the  three-factor  interaction  is  adjusted  for  all  two-factor  interactions 
and  main  effects,  the  sum  of  squares  for  each  two-factor  interaction  is  adjusted  for  all 
other  two-factor  interactions  and  all  main  effects,  and  the  sum  of  squares  for  each  main 
effect  is  adjusted  for  all  other  main  effects.  However,  the  sums  of  squares  for  the 
two-factor  interactions  are  not  adjusted  for  the  three-factor  interaction  and  the  sums 
of  squares  for  the  main  effects  are  not  adjusted  for  any  interactions.  Therefore,  if 
the  three-factor  interaction  is  significant,  the  main  effects  and  two-factor  interactions 
should  be  looked  at  in  two-way  tables  at  each  level  of  the  third  factor  and  if  there  are 
significant  first-order  interactions,  the  main  effects  of  the  factors  in  the  significant 
interaction  should  be  looked  at  separately  at  each  level  of  the  other  factor.  If  overall 
tests  of  significance  for  main  effects  or  two-factor  interactions  are  desired  in  the  above- 
named  cases,  then  a  different  analysis  should  be  performed. 
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4.  EXAMPLE 


The  basic  data  for  the  example  are  given  in  table  II.  There  are  three  levels  of 
factor  A,  two  levels  of  factor  B,  and  three  measurements  made  over  time,  with  the  time 
points  spaced  the  same  for  all  subjects.  The  number  of  subjects  for  each  treatment 
combination,  ny,  is  given  in  table  III.  These  are  constant  over  time;  i.e.,  p  =  3 
observations  were  made  over  time  on  each  subject. 


TABLE  II 


Basic  data  for  the  example 


Treatment 

Treatment 

Time 

A 

B 

1 

2 

3 

,39 

36 

38 

1 

38 

42 

30 

40 

28 

33 

\ 

36 

38 

31 

28 

2 

46 

44 

35 

44 

41 

36 

32 

34 

33 

56 

51 

53 

54 

40 

45 

1 

60 

46 

49 

32 

33 

31 

2 

39 

34 

32 

2 

54 

56 

50 

• 

^■jum 

38 

41 

37 

39 

33 

51 

38 

39 

3 

55 

48 

46 

39 

44* 

2 

49 

36 

42 

44 

39 

22 

TABLE  III 

The  number  of  subjects  luith  each 
treatment  combination  for  the  example 


Treatment 

A 

Treatment  B 

Total 

1 

2 

1 

3 

8 

2 

4 

6 

3 

3 

H 

7 

Total 

10 

11 

21 

6 


The  C  matrix  is  determined  from  table  III ; 


n  . . 

n,  . 

. 

n . , 

■ 

Yi 

8 

6 

10 

n,  . 

n,  . 

0 

"n 

8 

8 

0 

3 

iio  . 

0 

n.^  . 

6 

0 

6 

4 

n  M 

n,., 

"11 

n., 

• 

10 

3 

4 

10 

mm 

Its  inverse  is: 


C-'  = 


1  .180109286 
-.147521161 
-.122128174 
-  .087001669 


-.147521161 

.268440145 

.140266022 

.010882709 


The  Gk  are  determined  from  table  II: 


.122128174 

.140206022 

.321039903 

.048367593 


-.087061669 

.010882709 

-.048367593 

.203143894 


906 

836 

787 

306 

;  G.  = 

298 

;  G,  = 

271 

285 

259 

260 

437 

.387 

395 

From  these  quantities,  calculate: 


Gji'C-'G,  =  39,418.54 
G^,>C  'G...  =  33,459.44 
G/C-  'G.,  =  29,830.74 
G  .'fC-  'G  .  /p  =  102,223.80 
:2  Gk  'C-'G^  =  102,708.72 


Furthermore : 


2:  vf. . .  /pnj  .  =  102,217.80 


2  Y"*.  j . .  /pn  .  j  =  101,535.06 
S  S  Y*  . .  /pn,j  =  102,236.43 
2:  Y^. .  ^  .  /n  . .  =  101,861.95 


2:  2:  Y*.  k  .  /n, .  =  102,626  89 

I  k 


2  2:  y“  .  .  /n .  ,  =  101,969.57 
J  k 

2  2  S  Y,jk .  /n,.  =  102,761.92 
I  J  k 
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n  S  V ij  .  ,,,/p  =:  104.187.00 
I  J  III 

S  i:  S  2:  yL,,,  =  106.277.00 
1  J  k  III 

Y*. ...  /pn..  =:  101.521.29 

The  completed  analysis  of  variance  for  the  example  is  given  in  table  IV. 


TABLE  IV 

Analysis  of  variance  for  the  example 


Source 

d.f. 

S.  Sq. 

[  M.  Sq. 

F 

P 

A 

2 

688.74 

2.65 

NS 

B 

1 

6.00 

.05 

NS 

A  X  B 

2 

12.03 

.05 

NS 

Error  (a) 

15 

1,950.57 

130.04 

Time 

2 

.340.66 

170.33 

9.05 

A  X  t 

4 

50.41 

12.60 

.67 

NS 

B  X  t 

2 

75.83 

37.92 

2.01 

NS 

A  X  B  X  t 

4 

40.57 

10.14 

.54 

NS 

Error  (b) 

30 

564.51 

18.82 

Total 

62 

In  the  example,  all  of  the  probability  levels  for  the  interactions  are  greater  than  .10. 
Therefore,  it  seems  reasonable  to  assume  that  all  of  the  interactions  are  zero.  (It 
would  seem  desirable  to  use  a  probability  level  greater  than  .05  to  test  this  assump¬ 
tion.)  Under  this  assumption,  the  F-ratios  for  main  effects  give  valid  tests  of  the  main 
effects.  That  is,  the  F-ratios  give  tests  of  main  effects  only,  rather  than  main  effects 
plus  interactions  as  they  would  if  interactions  were  not  zero.  From  this  analysis,  there 
is  no  indication  of  an  effect  due  to  either  factor  A  or  factor  B.  However,  there  is  a 
highly  significant  time  effect. 

Even  in  an  experiment  as  small  as  this  example,  the  advantage  of  this  procedure  is 
apparent.  One  of  the  several  matrices  to  be  inverted  using  the  standard  procedure  of 
fitting  constants  would  be  of  order  12,  whereas  with  this  procedure  only  one  matrix 
of  order  4  need  be  inverted. 
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